Human Parsing


Human parsing is the process of identifying, segmenting, and categorizing different parts of a human body in an image or video such as head, shoulders, knees, and toes.

PARSE: An Open-Domain Reasoning Question Answering Benchmark for Persian

Add code
Feb 01, 2026
Viaarxiv icon

MCP-Diag: A Deterministic, Protocol-Driven Architecture for AI-Native Network Diagnostics

Add code
Jan 30, 2026
Viaarxiv icon

VDE Bench: Evaluating The Capability of Image Editing Models to Modify Visual Documents

Add code
Jan 27, 2026
Viaarxiv icon

SynMind: Reducing Semantic Hallucination in fMRI-Based Image Reconstruction

Add code
Jan 25, 2026
Viaarxiv icon

DoPE: Decoy Oriented Perturbation Encapsulation Human-Readable, AI-Hostile Documents for Academic Integrity

Add code
Jan 18, 2026
Viaarxiv icon

ShowUI-Aloha: Human-Taught GUI Agent

Add code
Jan 12, 2026
Viaarxiv icon

BoxMind: Closed-loop AI strategy optimization for elite boxing validated in the 2024 Olympics

Add code
Jan 16, 2026
Viaarxiv icon

Adaptive Constraint Propagation: Scaling Structured Inference for Large Language Models via Meta-Reinforcement Learning

Add code
Jan 06, 2026
Viaarxiv icon

VIPER Strike: Defeating Visual Reasoning CAPTCHAs via Structured Vision-Language Inference

Add code
Jan 10, 2026
Viaarxiv icon

pdfQA: Diverse, Challenging, and Realistic Question Answering over PDFs

Add code
Jan 06, 2026
Viaarxiv icon